SOME SCHEMES RELATED TO THE COMMUTING VARIETY 
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ABSTRACT. The commuting variety is the pairs of n x n matrices (X, Y) such that XY = YX. 
We introduce the diagonal commutator scheme, {(X, Y) : XY — YX is diagonal}, which we 
prove to be a reduced complete intersection, one component of which is the commuting 
variety. (We conjecture there to be only one other component.) 

The diagonal commutator scheme has a flat degeneration to the scheme {(X, Y) : XY 
lower triangular, YX upper triangular}, which is again a reduced complete intersection, 
this time with n! components (one for each permutation). The degrees of these components 
give interesting invariants of permutations. 
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1. Statements of results 

The commuting variety of a reductive Lie algebra a is defined as the reduced sub- 
scheme of g © a cut out by the equations [X, Y] = 0. It is known to be irreducible |Ri|, but 
even for a = gl n it is not presently known whether these equations serve to define it as a 
scheme (i.e. whether the ideal generated by these dim a equations is radical). 

Let h be a chosen Cartan subalgebra of a (we will work over C in this paper, so any two 
Cartan subalgebras are conjugate). We introduce the diagonal commutator scheme 

D = {(X,Y)e fl ffia:[X,Y]ef)}. 

This is now defined by only dimg/f) equations, rather than the dim a (and who knows 
how many more) equations needed to define the commuting variety. 

While our first theorem could be stated for general a, we only prove it in the case q = 
gl n , with f) the diagonal matrices (hence the name). 

Theorem 1. The diagonal commutator scheme D is a reduced complete intersection in al n © gl n , 
one component of which is the commuting variety. 

Date: February 8, 2008. 
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It is a well-known open problem, due to Artin and Hochster, to show that the commut- 
ing variety of gl n is Cohen-Macaulay. (Our reference is chapter 9 of |VJ.) The theorem 
above doesn't address that directly, but implies something related: 

Corollary. The commuting variety is Cohen-Macaulay if and only if the union of the other com- 
ponents is Cohen-Macaulay. 

In fact we conjecture that for any reductive g, the diagonal commutator scheme has 
exactly two components - i.e. only one component other than the commuting one. 

Proof. These two schemes are "directly linked," and then the theory of linkage (|E|, theo- 
rem 21.23) relates the Cohen-Macaulayness of one component to the union of the rest. □ 

We will prove this by studying a certain flat degeneration of the gl n diagonal commu- 
tator scheme. Let p : C x — > GL n (C) denote the one-parameter subgroup 

/ 1 \ 

t 

p(t) = t 2 

V t- 1 / 

of diagonal matrices. Define D z to be the subscheme of gl n © gl n 

D z = {(X',Y') : X'Y' = ptzJY'X'ptz- 1 ) off the diagonal} 

which is plainly isomorphic to D 1 = D under the map (X', Y') i— » (p(z) X, Yp(z -1 )). 

The flat family {D z }, z £ C x has a unique extension to a flat family over C, with special 
fiber D°. This sort of flat limit - by rescaling some of the coordinates of the ambient vector 
space - is called a Grobner degeneration. We know from |KS[ that such a limit must be again 
equidimensional (up to embedded components). But in our instance much more is true: 

Theorem 2. The flat limit D° := lim z _^o D z of this family is again a reduced complete intersection. 
Moreover, it is defined by the limiting equations on D z , 

{ (X, Y) : XY lower triangular, YX upper triangular} . 

Ordinarily more equations are needed in such a limit; we give a typical example. Con- 
sider the z — > limit of the equations X = 0, X = zY in the X, Y plane. For each nonzero 
z, these two equations describe two lines intersecting at the origin. But in the limit z = 
the two lines are equal and the condition Y = is lost; the correct limit is only obtained if 
that equation Y = is added to the list. The second conclusion of the above theorem says 
that this unfortunate phenomenon doesn't occur in our case: our list of dim g/1) equations 
is already enough for this limit. 

(For readers familiar with Grobner bases: this list of equations is not a Grobner basis, 
but is "Grobner enough" for this limit defined by a partial term order.) 

We have a better handle on the components of this scheme D°, because of the large 
group 

B x B + = { (L, U) G GL n (C) 2 : L lower triangular, U upper triangular} 
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acting on it by the rule 

(L,U) • (X',Y') = (LX'U^.UY'L- 1 ). 

This group is of dimension n 2 + n, slightly larger than the GL n (C) acting on the commut- 
ing variety. Since this group is connected, it preserves (and acts on) each component of 
D°. 

For 7tannxTi permutation matrix, define D° as the closure 

D°:=(B_xB + )- |J (Ttt.sTr- 1 ). 

t.seH 

Theorem 3. The components o/D° are exactly the {D°}, n e S n . 

Inside the flat family {D z } zeC , consider the component whose generic fiber is the commuting 
variety as a subfamily. The special fiber of this subfamily is D°, plus possibly some nonreduced 
structure. 

In the rest of the paper we prove these statements, in reverse order. Theorem|2]builds on 
the first half of theorem |3j whose proof uses simple facts about matrix Schubert varieties 
(our reference is |MS|), and we prove them together in subsection 12.11 Theorem [T] is then 
a consequence of theorem 

We close with a number of conjectures, generalizing the standard ones about the com- 
muting variety. 

I am pleased to thank Mark Haiman for introducing me to the commuting variety, and 
many useful conversations since. I am also grateful to David Eisenbud, Ezra Miller, and 
especially Terry Tao for their insights. 

2. A Grobner degeneration using p 

Consider the scheme of pairs of matrices 

E := { (X, Y) : XY, YX both upper triangular} 
which we'll call the upper-upper scheme. 
Proposition 1. Recall the scheme 

D° := lim {(X', Y') : X'Y' = ptzjY'X'ptz" 1 ) off the diagonal] 

z— > * * 

defined as the flat limit of this one-parameter family. 
This scheme embeds in E, via the map 

t : (X, Y) e D° i } (w X, Yw ) e E 
where w is the permutation matrix with 1 s along the antidiagonal. 

Proof. Consider the equations defining D z . When we conjugate Y'X' by p[z), it multiplies 
the upper triangle by positive powers of z, the lower triangle by negative powers (and 
leaves the diagonal alone, though we don't care). So the upper triangle of X'Y' is equal 
to the upper triangle of Y'X' times positive powers of t, and after rescaling, the lower 
triangle of Y'X' is equal to the lower triangle of X'Y' times positive powers of z. In the 
limit, we get the equations X'Y' lower triangular, Y'X' upper triangular on D°. Therefore 
(woX'.Y'wo) G E. ' ' □ 
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We will eventually prove in theorem |2] that this map is an isomorphism. We've twisted 
this scheme by w only because it's less confusing to deal always with upper triangular 
matrices, rather than to mix upper and lower. 

2.1. Dimensions of the components. The upper-upper scheme E carries an obvious ac- 
tion of pairs of upper triangular matrices, B + x B + : 

(U!,U 2 ) ■ (X,Y) := (U, XUj 1 , U 2 YU7 1 ) 

The projection p : (X, Y) t— > X is B + xB + -equivariant with respect to the action (Lii , U 2 ) -X := 
Ui X Uj 1 on the space of single matrices. 

We know the orbits of B + x B + on the space of matrices: each orbit contains a unique 
partial permutation matrix, a 0, 1 -matrix with at most one 1 in any row and column. 

We will need also a slightly more specific fact, that each orbit of N + x N + contains a 
unique monomial matrix, which has at most one nonzero entry in each row and column. 

Our reference for facts about these orbits, in particular their dimensions, is |MS[: 

Proposition 2 (|MS[, theorem 15.28). Let nbe a partial permutation matrix. The dimension 
o/B + 7rB + inside M n (C) is the number of matrix entries such that a 1 entry in n is either on it, 
directly below, or directly to the left. 

Given a partial permutation matrix n, let 

E 7t :=p- 1 ((B + x B+) -7t) 

so the {EtJ give a finite decomposition of E = Li^E^ into B + x B + -invariant locally closed 
subsets. 

Lemma 1. The stratum E n is smooth and irreducible, of dimension n 2 + [rank of it). 
If it is a permutation matrix (not just partial), then the set 

( N + x N + ) • { ( 7tt , S7r~ 1 ) , s,t invertible diagonal } 

is an open dense subset of~E n . 

Proof. The fiber over the "central" point n e E^ is 

{Y : n Y, Yn both upper triangular] 

which is a vector space. Since B + x B + acts transitively on vl^n), E^ is a vector bundle 
over the smooth irreducible p(E n ), and therefore smooth and irreducible. Moreover, the 
dimension of E^ is the dimension of the orbit in the base (given by proposition |2j), plus 
the dimension of the fiber. 

So let's compute the fiber dimension. The conditions 7tY, Yn upper triangular become, 
on matrix entries, that Yy must be zero if there is a 1 in n directly (and strictly) to the left, 
or directly (and strictly) below, entry ij. Otherwise Yy is free, which includes the case 
when there is a 1 in n actually in entry ij. 

Every matrix entry therefore "counts" for the dimension of the base (by proposition |2|l 
if it has a 1 in n below or to the left, counts for the fiber if it doesn't, and counts for both if 
it is actually placed at a 1 in n. So the total count is n 2 plus the number of 1 s, as was to be 
shown. 
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Consider now the N + xN + orbit of the point [nt, sn^), and assume s invertible, and 
that s 1 1 has no repeated entries. (Even excluding those ( s , t) it turns out that we'll still get 
a dense open set.) Plainly, this orbit is contained in E^by its definition. The infinitesimal 
stabilizer of [nt, S7t _1 ] consists of those pairs (A, B) of strictly upper-triangular matrices 
such that 

A7rt-7rtB = 0, BS7IT 1 - sn^A = 0. 

So 

n^An = tBt 1 = s^Bs 

making B commute with the generic diagonal matrix s H. Therefore B is diagonal, hence 
zero, and so too is A. 

Since the N + x N + -stabilizer of [nt, S7t _1 ) is trivial, its orbit is 2(™) dimensional. No two 
of these orbits intersect (see the comment before the lemma about monomial matrices), so 
we have a 2n-dimensional family of them, in all comprising n 2 + n dimensions. This is 
the same dimension as E^, so this subset is open (hence dense, since E^ is irreducible). □ 

The following lemma tells us some (but not all) of the equations separating the compo- 
nents of E. 

Lemma 2. Let nbe a permutation matrix. Then 

(X, Y) G E n => diag(XY) = tt • diag(YX), 
i.e. (XY) U = (YX i = 1 . . .n. 

Proof. It's enough to test this equality on the dense subset given us by lemmalll consisting 
of elements of the form (X, Y) = (UTTttLLj 1 , LI^stt -1 !!^ 1 ), where Ui , U2 G N + and s, t are 
diagonal. 

XY = U 1 7rts7t- 1 U7 1 
YX = U 2 stU2 1 

So their diagonals are the same as those of 7tts7r 1 and st. □ 

We give a precise conjecture of the equations defining the closure of E^ in section |3l 
Note that the obvious component of E, in which both X and Y are themselves upper 
triangular, is E lr whereas the component that interests us most is E Wo . 

Proofs of theorems^and^ The scheme E is defined by n 2 — n equations, and by lemma|T]is 
a finite union of pieces {E^} each of codimension > n 2 — n. So it is a complete intersection. 

Therefore it is pure, and only those pieces E^ of codimension exactly n 2 — n are com- 
ponents (the others lie in the closure). These are the E n for which n has rank n, i.e. is a 
permutation matrix and not just a partial permutation matrix. In particular this proves 
the first statement in theorem |3l 

It remains to show that E is reduced. Since it is a complete intersection and therefore 
Cohen-Macaulay, being generically reduced implies that it is reduced (see [E], exercise 
18.9). We will now find a smooth, reduced point [nt, S7t _1 ) on each E^. 

Let t, s be generic diagonal matrices (the genericity condition will be specified in due 
course). The scheme E is the zero set of the map 

(X, Y) i-> strict lower triangles of XY, YX, 
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whose differential at the point (nt, sn 1 ) is 

( A, B ) n strict lower triangles of AY + XB , YA + BX 

= strict lower triangles of As7r~ 1 + 7rtB, S7t _1 A + B7rt. 

We want to show this differential is onto. 
Consider A = Ae^-qj, B = \Le\ >n [\y Then 

(A, B) i— > strict lower triangles of Ae^t^-STr -1 + [Lnte^m, S7r _1 Ae^vy + [le^mnt 
= strict lower triangles of Xs^e^^ + [it\e nWt7t[l) , As^y + [it x e m 

In particular A = 1 , \i = gives us 

(A, B) i— > strict lower triangles of Sje n ^^), Ste^. 

If i > j but 7t(i) < 7t(j), we can use this to produce pairs (0, ey). If i < j but 7r(i) > n[j), 
we can use this to produce pairs (e^i)^-), 0). 

The hard case is when i > j and 7r(i) > n{]), then (i, j) = (k, I) gives us 

(A, B) i— > strict lower triangles of (As, + (J-ti)e 7t (i) )7t Q), (Asi + (j,t j ) eij 

and as long as Sj/si ^ tj/tj for any i, j, we can adjust A, \i to obtain (0, ey) and (e^i)^), 0) 
in the image. So we've gotten every pair of matrices where one has zero strict lower 
triangle and the other has exactly one entry in the strict lower triangle. These generate 
the target so the differential is indeed onto. 

We've found a reduced point in each component E^ of E. Therefore E is generically 
reduced, hence by its Cohen-Macaulayness it's reduced. 

We're now ready to knock off theorem |2] and the remainder of theorem |3J Theorem O 
amounts to the statement that the map in proposition |T] is an isomorphism, which we'll 
now prove. 

Since D 1 and E are complete intersections defined by n 2 - u quadratics, they have the 
same degree, 2 n ~ n . This map x : (X, Y) i— > (woX, Ywq) from proposition |T] is linear, so 
preserves degree. Taking flat limits also preserves degree. So the image t(D°) inside E 
has the same degree as E. Since E is equidimensional of the same dimension as t(D°), 
we find that t(D°) must include all of E's components. But this lower bound on t(D°) is 
already E, since E is reduced. 

By |KS|, the (reduction of the) z — > limit of any component of D z is again equidimen- 
sional, hence a union of some components of E. We want to see that the commuting com- 
ponent of D limits only to Ei, and that the non-commuting components of D accounts for 
all the other components of E. A priori one might expect some components {E^} to arise 
as components of both limits, but as E is generically reduced this does not happen. 

Let t, s be generic and n ^ 1 . Now note that each point [nt, snr 1 ) is in every D z . For 
z = 1 , this point is in a non-commuting component of D = D 1 . For z = 0, this point is 
in the E^ component (and no other, by the genericity). So the limit of the non-commuting 
components of D is all the non-identity components of E. □ 

Unfortunately, the result of |KS| doesn't let us rule out the possibility that the z — > 
limit of the commuting variety has embedded components. To show this doesn't happen, 
it would be enough to know that the variety E Wo is defined by the additional equations 
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diag(XY) = wo-diag( YX) with no more needed. That would also imply that the commuting 
scheme is reduced, which remains unknown at the time of this writing. 

2.2. Some results about (multi)degrees of components. Let denote the degree of the 
homogeneous affine variety *E n . 

Proposition 3. • The sum ^ 7teSn d^ is 2 n2 ~ n . 

• Denote by * : S k x S n _ k — > S n the standard concatenation of permutations. 
Then d n * p = d^dp. 

• Ifw is the permutation of length n of maximum length, then 

dn d n d Wfl Ttwg d Wo i Wfl . 

Proof For the first statement, the right-hand side is the degree of the quadratic complete 
intersection E, which is the sum of the degrees of its components. 

For the second, note that 

F~xF~x (A kx(n - k) ) 2 -> F^ 

((XlYO^yj.MlJVW * ((* y;),( X 2 y 2 )) 

is an isomorphism, and linear so degree-preserving. 

The third is really two statements. The n <-> 7r _1 symmetry comes from the map 
(X, Y) i— > (Y, X). The map 

(X,Y) i-> (woX T w 0) w Y T wo) 
is also easily seen to give a linear isomorphism of F^ and F Wo?r i Wo . □ 



With Macaulay 2 |M2|, we computed these degrees for small n and ★-irreducible n: 

d^ = ^ 
d 2 i =3 

d 2 3i = d 312 = 13, d 32 i = 31 

So for example when n = 3, 

2 32 ~ 3 = d 12 3 + d 213 + d 132 + d 312 + d 231 + d 321 
= d 3 + d 21 d! + d! d 21 + d 312 + d 231 + d 321 
= 1+3 + 3 + 13 + 13 + 31. 



In fact one can sharpen proposition[3]a great deal using the theory of multidegrees HMSI 
(also known as equivariant multiplicities |Rol), using not only the rescaling action but the 
full torus action on D°. Since we don't want to recapitulate this theory here - except to 
say that it assigns each cycle a homogeneous polynomial, rather than just a number - we 
give only a little taste, using the 2-torus action that scales X and Y individually. 

Proposition 4. Let A, B be the usual generators of the weight lattice of the 2-torus scaling X,Y 
individually. Let d' n denote the bidegree of~E n , a homogeneous polynomial in N[A, B]. 

• The sum ^ 7teSn d; is (A + B) n2 - n . 

• Denote by ★ : x S n -k — > S n the standard concatenation of permutations. Then 

C P = d;d p (AB)^. 



• If Wo is the permutation of length n of maximum length, then d' n — d' w , . 

• d;(A,B) = d^ 1 (B,A). 

The proofs are exactly as in proposition |3j The n = 3 example now becomes 

d{ = 1 , d 21 = A 2 + AB + B 2 

d 231 = 2A 5 B+4A 4 B 2 +4A 3 B 3 +2A 2 B 4 +AB 5 , d 312 = A 5 B+2A 4 B 2 +4A 3 B 3 +4A 2 B 4 +2AB 5 
d 321 = A 6 + 3A 5 B + 7A 4 B 2 + 9A 3 B 3 + 7A 2 B 4 + 3AB 5 + B 6 



(A + B) 3 - 3 = d^ + d^ + d^+d^ + d^ + d^ 

= (AB) 3 (d;) 3 + ABd^d; + ABd;d^ + d 312 + d 231 + d 321 

= A 3 B 3 + 2(A 3 B + A 2 B 2 + AB 3 ) 

+ (2A 5 B + 4A 4 B 2 + 4A 3 B 3 + 2A 2 B 4 + AB 5 ) 

+ ( A 5 B + 2A 4 B 2 + 4A 3 B 3 + 4A 2 B 4 + 2AB 5 ) 

+ (A 6 + 3A 5 B + 7A 4 B 2 + 9A 3 B 3 + 7A 2 B 4 + 3AB 5 + B 6 ) 

3. Conjectures 

There are two main conjectures about the commuting scheme: that it is reduced, and 
that it is Cohen-Macaulay. We state some conjectures sharpening these two. 

Conjecture. The variety E^ o/E is defined as a scheme by three sets of equations: 

(1) those defining E, which say XY, YX upper triangular 

(2) those given by lemma^ that diag(XY) = n ■ diag(YX) 

(3) those defining the n, 7r _1 matrix Schubert varieties: for each pair i, j the rank of the lower- 
left i x j rectangle in X (resp. in Y) is bounded above by the number of 1 s in that rectangle 
in n (resp. in n^). 

Note that for n = w , the third set is empty. 

Moreover, if we impose just the first and third set of equations, we get the reduced scheme 

U P <7T^P- 

If this is proved for n = w , it implies that the commuting scheme (to which E Wo de- 
forms) is reduced, i.e. is the commuting variety. 



Conjecture. Each individual {E n }, and each union U P <7t^p / zs Cohen-Macaulay. 

Note that these statements are trivial for the component E 1r being a linear subspace 
{(X, Y) : both upper triangular}. Perhaps they can be proved by induction in the Bruhat 
order. 

We repeat our earlier-stated conjecture (which doesn't seem to imply anything directly 
about the commuting scheme): 

Conjecture. For g a reductive Lie algebra, the diagonal commutator scheme of g is a reduced 
complete intersection with two components. 



In the gl n case, Terry Tao conjectured in a discussion the equations defining the other 
component. First, we find some equations that do in fact hold. 

Proposition 5. Consider the n x 2n matrix, whose first n columns are the diagonals of X 1 , 
i = . . . n — 1 , and next n are the diagonals of Y\ i = . . . n — 1 . 

If (X, Y) G H but [X, Y] 7^ 0, then the rank of this nx2n matrix is at most n — 1 . In particular 
every size n minor vanishes. 

Proof. Let K = [X, Y]. Then the nonzero diagonal matrix K is trace-perpendicular to any 
element Zx in the centralizer Cx of X: 

Tr(KZx) =Tr([X,Y]Z x ) =Tr([Z x ,X]Y) =TrO = 

The same argument holds for any Zy in the centralizer Cy of Y (rotating the opposite 
direction), and any linear combination Zx + Zy. 

The functional Tr (K-) is only sensitive to the diagonal entries, and the trace form Tr (■•) 
is nondegenerate. So the projection "take diagonals" from Cx + Cy to t is not onto, since 
it only hits K- 1 . (This is where we use K ^ 0.) 

The argument so far would work fine in any semisimple g, with the Killing form in 
place of the trace form. In the gl n case, we have a bunch of matrices we know to be in 
Cx (resp. Cy), namely the powers of X (resp. Y). The non-ontoness of the projection then 
gives us the rank claim in the proposition. □ 

Note that this gives one equation each on X and Y individually - while every X com- 
mutes with some Y (e.g. or X itself), not every X has a nonzero diagonal commutator 
with some Y. 

Conjecture. The equations in proposition\5\define the other component(s) of the diagonal com- 
mutator scheme. 

With Macaulay 2, we verified this in the gl 3 case - first by finding the equations, then 
using them to suggest the conjecture. The other conjectures were also all verified in this 
case. 
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